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Abstract 

Let X\ , X2 , • • • , -Xn , • • • be a sequence of iid random variables with 
H ■ values in a finite alphabet {1, . . . ,m}. Let LI n be the length of the 

longest increasing subsequence of X\ , X2 , . . . , X n . We express the lim- 
\q [ iting distribution of LI n as functionals of m and (m — l)-dimensional 

CO ' Brownian motions. These expressions are then related to similar func- 

tionals appearing in queueing theory, allowing us to further establish 
\q , asymptotic behaviors as m grows. The finite alphabet results are then 

used to treat the countable (infinite) alphabet. 
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X: 

S ; 1 Introduction 

The pursuit of a robust understanding of the asymptotics of the length of the 
longest increasing subsequence La n of a random permutation of length n - 
often known as " Ulam's Problem" - has given rise to a remarkable collection 
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of results in recent years. The work of Logan and Shepp [20], and Vershik 
and Kerov [25], first showed that ELa n / y/n — ► 2. Following this fundamental 
asymptotic result, Baik, Deift, and Johansson, in their landmark paper [3], 
determined the limiting distribution of La n , properly centered and normal- 
ized. This problem has emerged as a nexus of once seemingly unconnected 
mathematical ideas. Indeed, the latter paper is, in particular, quite remark- 
able for the sheer breadth of mathematical machinery required, machinery 
calling upon an understanding of random matrix theory, the asymptotics 
of Toeplitz operators, Riemann-Hilbert Theory, as well as the Robinson- 
Schensted-Knuth correspondence, to obtain the limiting Tracy- Widom dis- 
tribution. 

Initial approaches to the problem relied heavily on combinatorial argu- 
ments. Most work of the last decade, however, such as that of Aldous and 
Diaconis [1] and Seppalainen [25], have instead used interacting particle pro- 
cesses and so-called "hydrodynamical arguments" to show that Lo n j \fn — ► 2 
in expectation and in probability. Building on these ideas, Groeneboom [TT] 
proves such convergence results using only the convergence of random signed 
measures, while Cator and Groeneboom [5] prove that ELcr ra / 'y/n — > 2 in a 
way that avoids both ergodic decomposition arguments and the subadditive 
ergodic theorem. Aldous and Diaconis |2J also connect these particle process 
concepts to the card game patience sorting. Finally, Seppalainen [26] employs 
these particle processes to a verify an open asymptotics problem in queueing 
theory. Moving beyond the asymptotics of KLa n , Cator and Groeneboom 
[6] use particle processes to directly obtain the cube-root asymptotics of the 
variance of La n . Further non-asymptotic results for La n are found in [TO] . 

The related problem of the asymptotics of LI n when the sequence is drawn 
uniformly from a finite alphabet of size m has developed along parallel lines. 
Tracy and Widom [27J, as well as Johannson [18J , have shown that the lim- 
iting distribution again enjoys a direct connection to the distribution of the 
largest eigenvalue in the Gaussian Unitary Ensemble paradigm. Its, Tracy, 
and Widom [161 ELY] have further examined this problem in the inhomoge- 
neous case, relating the limiting distribution to certain direct sums of GUEs. 
In another direction, Chistyakov and Gotze [7] have pursued the two-letter 
Markov case. 

Problems from statistical physics have long inspired a lot of the research 
into these topics. Kuperberg [19], for instance, shows that certain quantum 
spin matrices are, in law, asymptotically equal to a traceless GUE matrix. 
The standard general overview of the subject of random matrices is Mehta 
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[2T], a work motivated and influenced by some of the origins of the subject 
in physics. 

While the above achievements have undoubtedly stimulated further in- 
quiry, one might still suspect that a more direct route to the limiting dis- 
tribution of LI n might be had, one whose methods reflect the essentially 
probabilistic nature of the problem. This paper proposes a step towards 
such an approach for the independent finite alphabet case, calling only upon 
some very well-known results of classical probablity theory described below. 
Indeed, the sequel will show that the limiting distribution of LI n can be con- 
structed in a most natural manner as a Brownian functional. In the context 
of random growth processes, Gravner, Tracy, and Widom [§] have already 
obtained a Brownian functional of the form we derive. This functional ap- 
peared first in the work of Glynn and Whitt [8], in queueing theory, and its 
relation to the eigenvalues of the GUE has independently been studied by 
Baryshnikov [4J. It is, moreover, remarked in [9] that the longest increas- 
ing subsequence problem could also be studied using a Brownian functional 
formulation. 

We begin our study of this problem, in the next section, by expressing 
LI n as a simple algebraic expression. Using this simple characterization, we 
then briefly determine, in Section 3, the limiting distribution of LI n (prop- 
erly centered and normalized) in the case of an m-letter alphabet with each 
letter drawn independently. Our result is expressed as a functional of an 
(m — l)-dimensional Brownian motion with correlated coordinates. Using 
certain natural symmetries, this limiting distribution is further expressed as 
various functionals of a (standard) Brownian motion. In Section 4, connec- 
tions with the Brownian functional originating with the work of Glynn and 
Whitt in queueing theory are investigated. This allows us to investigate the 
asymptotics as m grows. Section 5 is devoted to obtaining the correspond- 
ing results for countable alphabets. In Section 6, we finish the paper by 
indicating some open questions and future directions for research. 

2 Combinatorics 

Let Xi, X2, ■ ■ ■ , X n , . . . consist of a sequence of values taken from an m-letter 
ordered alphabet, aci < a.2 < ■ ■ ■ < a m . Let a r k be the number of occurrences 
of a r G {l,...,m} among X 1; X 2 , ■ ■ ■ , X k , 1 < k < n. Each increasing 
subsequence of Xi,X 2 , . . . ,X n consists simply of runs of identical values, 
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with the values of each successive run forming an increasing subsequence 
of a r . Moreover, the number of occurrences of a r G {a±, . . . , a m } among 
Xk+i, ■ ■ ■ ,Xe, where 1 < k < £ < n, is simply a\ — a r k . The length of the 
longest increasing subsequence of Xi, X 2 , . . . , X n is then given by 



LI n = max [(al - al) + (ai 2 - a 2 kl ) + ■ ■ ■ + (a™ - aZJ], (2.1) 

0<fci<- 



<fc m _i<n 



i.e., 

LI n = max [« - <) + « - <) + ■■■+ (a^ - a™ _J + a™], (2.2) 

0<«i<"- 
<k m -i<n 

where a r = 0. For i — 1, . . . , n and r — 1, . . . , m — 1, let 

{1, iiXi = a r , 

-1, ifX, = a r+1 , (2.3) 

0, otherwise, 

and let = $^£=i ^ = 1, • • • with also Sq = 0. Then clearly S k = 
a k ~ a k +1 - Hence, 

LI n = max {Sl+Sl + .-. + S^+a^}. (2.4) 

0<fei<" 
<km-l<n 

Since a£, . . . , a™ must evidently sum to k, we have 



n 



= £< 

r=l 
m— 1 

= -5>K +1 -0 + ™C 

m— 1 

= -J2 rS n + m <- 



r=l 
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Ill — J 

m m 

r=l 



Solving for a™ gives us 

r=l 

Substituting into (|2.4p . we finally obtain 

1 m—1 

L/„ = --VrS;+ max + S 2 k2 + ■ ■ ■ + S™~\}. (2.5) 



m m ^—f o<fci< 

km— 1 



' 1 <k m -i<n 



The expression (I2.5P is of a purely combinatorial nature or, in more prob- 
abilistic terms, is of a pathwise nature. We now analyze (12.51) in light of the 
probabilistic nature of the sequence X\, X 2 , ■ ■ ■ , X n . 



3 Probabilistic Development 

Throughout the sequel, Brownian functionals will play a central role. By a 
Brownian motion we shall mean an a.s. continuous, centered Gaussian process 
B(t), < t < 1, with -B(O) = 0, having stationary, independent increments. 
By a standard Brownian motion we shall mean that VaxB{t) — t, < t < 1, 
i.e., we endow C[0, 1] with the Wiener measure. A standard m- dimensional 
Brownian motion will be defined to be a vector-valued process consisting 
of m independent Brownian motions. More generally, an m- dimensional 
Brownian motion shall refer to a linear transformation of a standard Tri- 
dimensional Brownian motion. Throughout the paper, we assume that our 
underlying probability space is rich enough so that all the Brownian motions 
and sequences we study can be defined on it. 

We consider first the case in which X\, X 2 , . . . , X n , . . . are iid, with each 
letter drawn uniformly from A = {a\, . . . , a m }. Then for each fixed letter r, 
the sequence Z{, . . . Z r w . . . is also formed of iid random variables with 
P(Z[ = 1) = F(Z[ = -1) = 1/m, and P(Z[ = 0) = 1 - 2/m. 

Thus EZ[ = 0, and E(Z[) 2 = 2/m, and so, VarS 1 ^ = 2n/m, for r = 
1, 2, . . . ,ra — 1. Defining B^t) = ^=S {nt] + -^=(nt - [nt])Z{ nt]+1 , for 

< t < 1, and noting that the local maxima of B l n (t) occur at t = k/n, 
k = 0, . . . , n, we have from (12.51) that 



5 



T T — / 1 m ~ 1 

n ATT- i = E^n(l)+ max [S^) + ■ • • + K'^t^)]. (3-1) 

^2n/m rn -f-' o<ti<- 

We note that we can now invoke Donsker's Theorem since the mea- 
sures F n generated by (B^t) , . . . , B™" 1 (t)) satisfy P n (A) F^A), for 
all Borel subsets A of the space of continuous functions C([0,l] m_1 ) for 
which Foo^dA) = 0, where is the limiting (m — l)-dimensional Wiener 
measure. Then, applying Donsker's Theorem and the Continuous Map- 
ping Theorem we have that . . . , B™~ l {t)) (B l (t), . . . , B rn ~ 1 (t)), 
where the Brownian motion on the right has a covariance structure which 
we now describe. First, Cov(Z[,Z*) = KZ[Z^ = 0, for |r — s\ > 2, and 
Cov(Z[,Z[ +1 ) = EZ[Z[ +1 = -1/m, for r = 1,2, ...,m- 1. Then, as al- 
ready noted, for each fixed r, Z[,Z£, ... . . . are iid, and for fixed 

Z%, Z%, . . . , Z™~ x are dependent but identically distributed random variables. 
Moreover, it is equally clear that for any r and s, 1 <r < s <m — 1, the se- 
quences (Zl) k >i and (Z|)£>! are also identical distributions of the Z£ and that 
Z r k and Z| are independent for k ^ I. Thus, Cov(S^, S^) = nCov(Z[, Zj 5 ). 
This result, together with our 2n/m normalization factor gives the following 
covariance matrix for . . . , i? m_1 (t)): 



/ 1 "1/2 O \ 

-1/2 1 -1/2 



t 



(3.2) 



O -1/2 1 -1/2 

V -i/2 i y 

We remark here that the functional in fl3.ll) is a bounded linear functional 
on C(0, l) m_1 . (This fact will be used throughout the paper.) Hence, by a 
final application of the Continuous Mapping Theorem, 



j j I m—1 m—1 

v^7^ m 4f V ^ o< tl <... ^ KlJ K J 

v ' i=l <* ro -i<l 4=1 

We have thus obtained the limiting distribution of LI n as a Brownian 
functional. Tracy and Widom [27J already obtained the limiting distribution 
of LI n in terms of the distribution of the largest eigenvalue of the Gaus- 
sian Unitary Ensemble (GUE) of m x m Hermitian matrices having trace 
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zero. Johansson [TB] generalized this work to encompass all m eigenvalues. 
Gravner, Tracy, and Widom [H] in their study of random growth processes 
make a connection between the distribution of the largest eigenvalue in the 
mxm GUE and a Brownian functional essentially equivalent, up to a normal 
random variable, to the right hand side of (13. 3p . (This will become clear as 
we refine our understanding of (I3.3P in the sequel.) For completeness, we 
now state our result. 



Proposition 3.1 Let Xx,X%, . . . , X n , ... be a sequence of iid random vari- 
ables drawn uniformly from the ordered finite alphabet A = {aci, . . . , a m }. 
Then 



T T I 1 m—1 m—1 

Lln-n/m ]_y i§i(1) Y B\U), (3.4) 

v 1 1=1 <t m -i<i 1=1 

where (i? 1 (t), . . . , B m_1 (t)) is an {m — 1) -dimensional Brownian motion with 
covariance matrix given by (13.21) . 



For m = 2, (13.41) simply becomes 



LIn 71,2 - -~5(l) + max£(t), (3.5) 



in 2 o<t<i 

where B is standard one-dimensional Brownian motion. A well-known re- 
sult of Pitman [24J implies that, up to a factor of 2, the functional in (13.51) 
is identical in law to the radial part of a three-dimensional standard Brow- 
nian motion at time t — 1. Specifically, Pitman shows that the process 
2 maxo< s <( B(s)-B(t) is identical in law to ^(B l (t)) 2 + (B 2 (t)) 2 + (B 3 (t)) 2 , 
where (S 1 (t), B 2 (t), B 3 (t)) is a standard 3-dimensional Brownian motion. 

Let us now show that the functional in (13.51) does indeed have the same 
distribution as that of the largest eigenvalue of a 2 x 2 zero-trace matrix of 
the form 

X Y + iZ 
Y -iZ -X 



where X, Y, and Z are centered independent normal random variables, all 
with variance 1/4. These random variables have a joint density given by 
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h(x, y, z) = ^ e -^-^\ (x, y, z) G M 3 . 

It is straightforward to show that the largest eigenvalue of our matrix 
is given by Ai = \JX 2 + Y 2 + Z 2 . Thus, up to a scaling factor of 2, Ai is 
equal in law to the radial Brownian motion expression of Pitman at t — 1. 
Explicitly, since A\ 2 = AX 2 + AY 2 + AZ 2 consists of the sum of the squares of 
three iid standard normal random variables, AX 2 must have a \ 2 distribution 
with 3 degrees of freedom. Since this distribution has a density of h(x) = 
(l/\^2n)x 1 / 2 e~ x / 2 , we immediately find that Ai has density 



0(Ai) = -^(4A?) 1/2 e-^?)/ 2 (8A 1 ) 
= ^L A ?e- 2A ?, Ax > 0. 

V27T 

Let us look now at the connection between the 2x2 GUE and the traceless 
matrix we have just analyzed. Consider the 2x2 matrix 

/ X 1 Y + iZ\ 
\Y-iZ X 2 ) ' 



where Xi, X 2 , Y, and Z are independent normal random variables, with 
VarXi = VarX 2 = 1/2, and with VarY = VarZ = 1/4. Since these random 
variables have a joint density given by 

2 

conditioning on the zero-trace subspace {X\ + X 2 = 0}, and using the trans- 
formation X[ = (Xi - X 2 )/y/2 and X' 2 = (X 1 + X 2 )/y/2, we obtain the 
conditional density 

which is also the joint density of three iid centered normal random variables 
X[, Y, and Z with common variance 1/4, which we had previously obtained. 
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Let us finally note that one can directly evaluate (I3.5P in a classical man- 
ner using the Reflection Principle to obtain the corresponding density (see, 
e.g. 0[T3]). 

It is instructive to express (13.41) in terms of an (m — l)-dimensional stan- 
dard Brownian motion (B l (t), . . . , B m ~ l {t)). It is not hard to check that we 
can express B % (t), i = 1, . . . , m — 1, in terms of the B l {t) as follows: 



'B\t), i = l 



l> ° ' ~ f^f Bi (t) ~ \[^f Bl ~ 1 ( t )> 2 < i < m - 1. ( °' 0) 

Substituting (13. 6ft back into (13.41) . we obtain a more symmetric expression 
for our limiting distribution: 



LI n — n/m 1 ^-^ 
— =>- —= max y 

'n Jm o<ii<--- 



rn - 



' B\t l+l ) + J % -^B\U) 



(3.7) 

The above Brownian functional is similar to one introduced by Glynn and 
Whitt [S], in the context of a queueing problem: 

m 

D m = max V^)-^^)], (3.8) 

<f ! <f —1 1=1 

_: i-m— l_i L m — - 1 - 

where (B l (t), . . . , B m (t)) is an m-dimensional standard Brownian motion. 
Gravner, Tracy, and Widom [5], in studying a one-dimensional discrete space 
and discrete time process, have shown that its limiting distribution is equal in 
law to both D m and also to the largest eigenvalue of an m x m Hermitian 
matrix taken from a GUE. That is, D m and are in fact identical in 
law. Independently, Baryshnikov [1], studying closely related problems of 
queueing theory and of monotonous paths on the integer lattice, has shown 
that the process (D m ) m >i has the same law as the process (A^ ) m >i, where 
A[ m ' ) is the largest eigenvalue of the matrix consisting of the first m rows and 
m columns of an infinite matrix in the Gaussian Unitary Ensemble. 

Remark 3.1 It is quite clear that LI n > n/m a.s., since at least one of the 
m letters must lie on a substring of length at least n/m. Hence, the limiting 
functional in (13.41) must be supported on the positive real line. Can we see 
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directly that a.s. the functional on the right hand side of (I3.7P is also non- 
negative ? Indeed, for consider the more general Brownian functional of the 
form 




<t. 



1 — -1 
<i —1 *=1 



where < < rji, for i = 1, 2, . . . , m — 1. iVou; /or any /ized t i+ i G (0, 1], 
i = 1, . . .,m — 1, max < ii < ti+1 — r/j-B 1 ^)] is at /east as /arae as 

£ae maximum value at the two extremes, that is, when ti — or ti = U+i. 
These two values are simply j3iB l (t i+ i) and (Pi — r]i)B l (ti + i) . Since < 
A < Vi a - s -> a t least one of these two values is non-negative. Hence, we 
can successively find t m _i, t m _2, ■ ■ ■ ,t± such that each term of the functional 
is non-negative a.s. Thus the whole functional must be non-negative a.s. 
Taking $ = \/i/(i + 1) and rji = + the result holds for (13.71) . The 
functional of Glynn and Whitt in (I3.8P does not succumb to the same analysis 
since the i = 1 term demands that to = 0. 

Let us now turn our attention to the m-letter case wherein each letter 
a r G occurs with probability < p r < 1, independently, and the p r need 
not be equal as in the previous uniform case. For the non- uniform case, 
Its, Tracy, and Widom in [16] and [17] obtained the limiting distribution 
of LI n . Reordering the probabilities such that p\ > p 2 > • • • > p m , and 
grouping those probabilities having identical values pu\ of multiplicity kj, 

j = l,...,d, (so that Y^j=i^j = 171 an d Y^j=iP(j)kj = 1); they show that 
the limiting distribution is identical in law to the distribution of the largest 
eigenvalue of the direct sum of d mutually independent kj x kj GUEs, whose 



eigenvalues (Ai, A 2 , . . . , X m ) = (A* 1 , A* 1 , . . . ,X k k \, . . . , X kd , X k 2 d , X k k d d ) satisfy 



Y^iLi y/P~iK = 0- With the above ordering of the probabilities, the limiting 
distribution simplifies to a A^-fold integral involving only p\ and k\. (See 
Remark 14.41 for some explicit expressions and more details.) We now state 
our own result in terms of functionals of Brownian motion. 

Theorem 3.1 Let X\,X 2 , . . . , X n , ... be a sequence of iid random variables 
such that P(Xl = a r ) = p r , for r = l,...,m, where < p r < 1 and 
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T T 1 m ~ 1 

Pmax n 1 V"^ . Bifi\ , 

7= =^ > v »<T f B (1) + 



m— 1 



max 

0=to<ti< 

<^ 1 <^ —1 2 = 1 
_: t"m — 1 _ l m — - 1 - 



(3.9) 



w/iere p max = maxi< r < m p r; of = p r + p r+1 - (p r - p r +i) 2 , I* = {r e 
{l,...,m} : p r < Pmax}, and where (5 1 (t), . . . , -B m-1 (t)) is an (to — 1)- 
dimensional Brownian motion with covariance matrix given by 



( 1 

P2,l 



Pi, 2 Pl,3 
1 P2,3 



\Pm-l,l Pm-1,2 



Pl,m-1 \ 
P2,m-1 

1 Pm-2,m-l 
Pm-l,m-2 1 / 



with 



-Pr - 
-Ps ~ 



Mr Ms 
Mr Ms 



s = r — l, 
s = r + 1, 
|r — s| > 1 ; 



and u>rtn /i r = p r 



(TrO's 

Pr+1, 1 < T < 771 — 1. 



1 < r, s < m — 1 , 



Proof. As before, we begin with the expression for Li,,, displayed in (12. 5p . 
noting that for each letter a r , 1 < r < to — 1, i forms a sequence of iid 

random variables, and that moreover Z£ and Z\ are independent for k ^ £, 
and for any r and s. Now, however, for each fixed k, the Z r k are no longer 
identically distributed; indeed, 



Pr ■-- 



EZ[ = p r - Pr+1, 
V&rZl = p r + Pr+1 



{Pr - Pr+lf 



1 < r < to — 1, 
1 < r < to — 1. 



(3.10) 



Since < p r < 1, we have of > for all 1 < r < m — 1. We are thus led 
to define our approximation to a Brownian motion by 



Kit) :-- 



nt] 



+ (nt - [nt]) 



H+i 



a r \/n 



< t < 1, 



to — 1. 
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Again noting that the local maxima of B l n (t) occur on the set {t : t 
k/n, k = 0, . . . , n}, (I2.5p becomes 



m— 1 



LL 



n 

m m 



71 L 

1=1 

{m-1 
\a i B t n (ti)Vn + fiit, 



Next, 



^^m- 1 ^ 1 m — 1 



n— 1 m— 1 m-1 m-1 



m— 1 

7=1 7=1 j=i 7=1 j=i 

m— 1 

= J^(Pi ~ = ( X ~ P™) ~ ( m ~ 

i=l 

= 1 - mp m . 



Hence, (13.111) becomes 



LL 



n (1 — mp m )n 1 



m— 1 



m 



772 



III 



i=i 



m— 1 



+ max V" cr^^Vri + M 
0=to<ii<- f—f L 

_: I'm— 1 _ b m — -L 



n 



and, dividing through by y/n, we obtain 
LL 



= PmVn - — Y] wrA(l) 
v i=i 

771—1 

+ max V" o-jS* (tj) + fiitisfn 
o=to<ti<- f-f L 

Let t — 0, and let A; = — z = 1, . . . , m — 1. Since 

771—1 771— 1 j 771—1 771—1 771—1 

7=1 J = l 



7 Pm j i 



(3.11) 



(3.12) 



(3.13) 



7 = 1 



7 = 1 j=i 

12 



7 = 1 



(I3.13P becomes 



LI 1 T~l 

= PmVn i°iB l n (l 

Jn m z — ' 

v i=l 



{m— 1 m—l 
(TiB i n (ti) + v^y ]\(pi - Pmj { ■ 



(3.14) 



where U = J2]=i 

Setting also A m = 1— t m -\ {i.e., t m := 1), f)3.14p enjoys a more symmetric 
representation as 



m—l 



n m 



in < * 



i=l 



+ max 

A t >0 



m— 1 m 

i=i 



Let p maa; = max 1 < i < m p i . Then 



m—l 



n 



i=l 



m < * 

m—l m 



+ max 

A»>0 

ES 1 Ai=i 



2 Pmax 



i=l 



i=l 



Clearly, if A 4 > for any i such that < p ma x, then 



\ a.s. 
Pmax) OO- 



i=l 



Intuitively, then, we should demand that A, = for i G I* 
{1,2,..., m} : pi < Pmax}- Indeed, we now show that in fact 



(3.15) 



(3.16) 



{i e 



Lin Pmaxfl 



m—l 



m—l 



-— y\W% (1) + max V (nB^U) + E n , (3.17) 
m ' o=t <ti<--- 

2=1 <t^. , <^_=1 1=1 



0=t <tl<- 

<^ ! <^ — 1 2=1 
ti=ti-i, iei* 
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where the remainder term E n is a random variable converging to zero in 
probability as n — > oo. 

To see this, let us introduce the following notation. Writing 
t = (t\, tii ■ ■ ■ , t m -i), let T = {t : < t\ < ■■■ < t m _i < 1} and let 
T* = {t E T : U = U- U i E I*}. Setting C n (t) = YT=x °A{U) and 
R{t) = Y^iLii^i ~ ti-i)(Pmax ~ Pi): we can ; respectively, rewrite the maximal 
terms of flXTo]) and fETTTD as 

max [C n {t) - s/nR(t)] 

and 

maxC n (t). 

teT* 

By the compactness of T and T* and the continuity of C n (t) and R(t), we 
see that for each n and each u E tt, there is a r n E T and a r* E T* such 
that 

C n (r n ) - \fnR(r n ) = max [C n (t) - \/nR(t)] , 

and 

C n (r*) = maxC n (t). 

(Note that the piecewise-linear nature of C n (t) and the linear nature of R(t) 
imply that the arguments maximizing the above must lie on a finite set and 
that the measurablility of r n and r* is trivial.) 

Now we first claim that the set of optimizing arguments {r n }^ =1 a.s. does 
not have an accumulation point lying outside of T*. Suppose the contrary, 
namely that for each u in a set A of positive measure, there is a subsequence 
( r n k )kLi of (r n )^ =1 such that d(r nk ,T*) > e, for some e > 0, where the metric 
d is the one induced by the L^-norm over T, i.e., by ||£||oo = ^-^i<i< m -i 

Then, since T* C T, it follows that, for all n, 

C n (r n ) - V^R(r n ) > C n {r* n ), 

almost surely. Now if p ma x = Pm, then t = (0, . . . , 0) E T*, and if for some 1 < 
j < m - 1 we have p max = pj > m&x j+1 <i< m pi, then t = (0, ... ,0,1, ... ,1) E 
T* , where there are j zeros in t. Hence C nh (r* ) > C„ fc (0, . . . , 0, 1, . . . , 1) = 

Y^j+i a i^n k (1) a - s -' where the sum is taken to be zero for j = m. Given 
< 5 < 1, by the Central Limit Theorem, we can find a sufficiently negative 
real a such that 
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P(CW(r n J - V^kR(r nk ) > a) > P (C nfc «J > a) 

(m-1 
^ (7^(1) > a 
i=j+i 

> 1 - 5, 

for nfc large enough. In particular, this implies that 

F(AD{C nk (r nk ) - ^T k R(r nk ) > a}) > ±F(A), (3.18) 

for nk large enough. 

Next, note that for any t G T, we can modify its components U to obtain 
an element of T*, by collapsing certain consecutive t^s to single values, where 
% E {j — 1, j, . . . , £} and {j, j + 1, . . . , £} C J*. With this observation, it is not 
hard to see that by replacing such maximal consecutive sets of components 
{tiYi=j-\ with their median values, we must have 



\'n k n k J 



d(r nk ,T*) = max 



Writing p( 2 ) for the largest of the pi < p max , we see that for all k, and for 
almost all wG/1, 



i=l 

= J^ 7 ""* ~ ^n^iPmax ~ Pi) 

iei* 

>(p maX -P(2))J2( r n k - T n~ k 1 ) 

> 2(Pmax - P(2))d(r nk ,T*) > 2(p max -P( 2 ))e. 



Now by Donsker's Theorem and the Continuous Mapping Theorem, we 
have that 

m— 1 m— 1 

max ^ a,fi; fc (U) max ^ (t t ), 
i=i i=i 
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as rift — > oo, where . . . , B m ~ l (t)) is an (m — l)-dimensional Brownian 

motion described in greater detail below. The point here is simply that this 
limiting functional exists. Hence, given < 5 < 1, if M is chosen large 
enough, then 

P(C„ fc (r Bfc ) < M) > P (C nk {r* nk ) < M) 

( m-l \ 

> 1 - 5, 

for large enough. 

We now can see how the boundedness of R(r nk ) on A influences that of 
the whole expression C nk (r nfc ) — ^JnkR{T nk ) by the following estimates. Given 
M > as above, if /c is large enough, then 

n k > yJ(M-a + l)/(2(p max -p (2) )e), 

and also 



P {A n {C„ fc (r n J - y/^R(T nk ) < a - 1}) 

= P (A n {C„ fc (r n J < a - 1 + v^i?(r n J}) 

> P (A n {C njk (r n J < a - 1 + V^(2(p mM - p (2 ))e)}) 

>P(AnK(r n J<M}) 

But this contradicts (I3.18p . and our optimal parameter sequences (r n )^ =1 
must a.s. have their accumulation points in T*. 

Thus, given e > 0, there is an integer N £ such that the set A n>e = 
{d(r k ,T*) < e 3 ,k > n} satisfies P(A n ,e) > 1 - e, for all n > N £ . Now 
for each r n define f n 6 T* to be the (not necessarily unique) point of T* 
which is closest in the L°°-distance to r n . Recalling that 

E n = C n {r n ) - v^i?(r n ) - C n «) > 0, 

almost surely, and noting that R(t) > 0, for all t G T, we can estimate the 
remainder term £7 n as follows: for n > N e , 
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P (E n > e) = P {{E n > e} n A n , e ) + P {{E n > e} n A^ e ) 
<P({S„>e}nA», e )+F(^ e ) 

< P ({K > e} n A^) + e 

= P ({C n (r n ) - y/KR{r n ) - C n {r*) > e} n A n , e ) + e 

< P ({C„(r n ) - Vnfl(r n ) - C„(f n ) > e} n A n , e ) + e 

< P ({C n (r n ) - C n (f n ) > e} n A n , £ ) + e 

m— 1 

<IP| j^a- 4 (J3^(7^)-Ja*(f*)) >e]+r. 



(3.19) 



To further bound (I3.19p . note that for all n > 1 and all 1 < i < m — 1, 
we have Var(_B^(tj) — B l n (si)) = |tj — Sj|. Then, let (s,£) G T x T be such 
that ||i — sjloo < e 3 . Using the Bienayme-Chebyshev inequality, we find that 
for n large enough, 



P 



m— 1 



i=i 



>e <e _i (m-l) 2 max af\\t-s\ 

Ki<m—1 



2,3 



< e (m — 1) max e 

\<i<m-l ' 

= e(m — l) 2 max of 

Ki<m-1 



Since ||r n — f n || < e 3 , for n> N e , this can be used to bound (13. 19ft : 



P(|£„| >e) < 



m— 1 



i=l 



> e 



< e < (m — 1) max cr^ + 1 > . 

Ki<m— 1 



Finally, e being arbitrary, we have indeed shown that _E n — > in probability. 



Applying Donsker's Theorem, the Continuous Mapping Theorem, and 
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the converging together lemma to (I3.17P we finally have: 

m— 1 



Lin Pmax'H' 



n 



m—l 

— ia * 

m z — ' 



B\l)+ max y^aiBHti), (3.20) 

Q=t <ti<--- 



<+ i <f — 1 %— 1 



where . . . , i? m is an (m — l)-dimensional Brownian motion with 

the following covariance matrix: 



/ 1 Pl,2 Pl,3 
P2,l 1 P2,3 



Pl,m-1 \ 
P2,m-1 



: : 1 Pm-2,m-l 

\Pm-l,l Pm-l,m-2 1 / 



where 



-p r - s = r - 1, 



■i^-, It — s| > 1, Kr,s<m — 1. 

Now for t = £/n, and l<r<s<m — 1, the covariance structure above 
is computed as follows: 



\ i=l i=l 



(t £ 
i=i i=i 

— ^Cov(zr-^.,z?-^) 



na r a s * 



i=i 



no r a R 



-Cov(Z[ - /i r , Z| - ^ 



over. 



Ps 



s = r, 



t < 



-^(0- fir/Is- fl r fl s + fl r fl s ), S>r + 1, 
k ^—(-Ps - PrPs - PrPs + PrPs), S = T + 1, 
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■t*^ s > r + 1, 

a r a s 

(ps+fJ-r/J-s) 



s = r + 1, 

using the properties of the Z r k noted at the beginning of the proof. ■ 

We now study (13.91) on a case-by-case basis. First, let I* = 0, that is, let 
Pi = 1/m, for % — 1, . . . , m. Then of = 2pi = 2/m, for all i G {1,2,..., m}. 
Hence, simply rescaling (13.91) by recovers the uniform result in (|3.4p . 

Next, consider the case where p max = Pj, for precisely one j G {1, . . . , m}. 
We then have I* = {1, 2, ... , m}\{j}. This forces us to set = t Q = ti = 
■ ■ ■ = tj_i and tj = tj + i = ■ ■ ■ = t m _i = t m = 1, in the maximizing term in 
This leads to the following result. 



Corollary 3.1 If p max — Pj for precisely one j G {1, . . . , m}, then 

T j m— 1 m— 1 

— ~ P r maxn =► -- V io-^(l) + V ^(1), (3.21) 

v i=l i=j 

where the last term in (13.211) is not present if j = m. 

Remark 3.2 (i) Above, (LI n —p ma x n )/Vn converges to a centered normal 
random variable. Intuitively, this result is not surprising since the longest 
increasing subsequence is, asymptotically, a string consisting primarily of 
the most frequently occurring letter, a string whose length is approximated 
by a binomial random variable with parameters n and p max - We show be- 
low that the variance of the limiting normal distribution is, in fact, equal to 

Pmax (1 Pmax ) ■ 

(ii) One could compute the variance of the right hand side of (13.211) directly 
to verify that it is in fact p ma x(^ — Pmax)- However, the nature of the co- 
variance structure of the Brownian motion makes the calculation somewhat 
cumbersome. Instead, we revisit the appoximation to our Brownian motion 
in the first term on the right hand side of (13.211) . In doing this, we not 
only recover the variance of the limiting distribution, but also see that our 
approximating functional does indeed take the form of the sum of a binomial 
random variable and of a term which converges to zero in probability. 

Proof. We have from the very definition of the approximation to Brownian 
motion that 
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m— 1 



m—1 



1 E^(1) = - 1 E 

m ^ m ^ 

i=l i=l 



m—1 



m—1 



IfJLi 



i=l 



Recalling that — — V"* , 1 iSl 
(I3.22p becomes 



„m n_ 



and that x ip, 



1 



a? ) + — (l-mj) r . 



m 



n 



(3.22) 
1 - mp m , 

(3.23) 



Turning to the second term on the right hand side of ( 13.211) and noting 
that for 1 < j < k < m — 1, Yli=j A*i = Pj~Pk+i and that £]i=j ^ = a r — a r +1 > 
for 1 < r < n, we then have 



m—1 



m—1 m—1 
E S n ~ n E 



— [{aP n -n Pj ) 



(3.24) 



We saw in (I3.17P that we could write (LI n — p max n) / \/n, as the sum 
of a functional approximating the Brownian motion and of an error term 
E n converging, to zero, in probability. In the present case, this expression 
simplifies to 



_ m—1 m—1 j 

- - E + E + E - = Q " ~r V ° + E ^ ( 3 - 25 ) 

m L — ' z — ' \/n 

i=l i=j v 

using fl3T22l - fl3~2D . 

Now a 3 n is a binomial random variable with parameters n and p = Pj = 
Pmax- By the Central Limit Theorem and the converging together lemma, 
the right hand side of (13.251) converges to a N(0,p max (l—p max )) distribution, 
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while by Donsker's Theorem, the left hand side converges to the Brownian 
functional obtained in (I3.2ip . Hence, (LI n — p ma x n )/Vn =^ N(0,p max (l — 
Pmax)), as claimed. ■ 

Let us now study what happens when p ma x = Pj = Pk, 1 < j < k < m, 
and Pi < p max otherwise, that is, when precisely two letters have the maximal 
probability. We then have /* = {!,..., m}\{j, k}. This requires that 



= to = ti = ■ ■ ■ = tj-x, 
tj = tj+i = ■ ■ ■ = tk-i, 
tk — tk+l = • • • = t m = 1. 



Hence, 



m—1 

max > c>iB l {ti) = max 

0=to<ti<». ^ 0<t<l 

<i -i <f —1 1=1 



k— 1 m—1 

5><#(f) + Xy^ci) 

m—1 fe— 1 

= J2^(l)+m&xJ2c- l B\t). 

i=k i=j 

Thus the limiting law is 

. m—1 m—1 k—1 

- - E + E ^c 1 ) + ^ a < x i E ^(*)- ( 3 - 26 ) 

i=l i=fe i=j 

To consolidate our analysis, we treat the general case for which p max oc- 
curs exactly k times among {pi,p2, ■ ■ ■ ,Pm}, where 2 < k < m — 1. Not 
only will we recover the natural analogues of (|3.26p . but we will also express 
our results in terms of another functional of Brownian motion which is more 
symmetric. Combining the 2 < k < m — 1 case at hand with the k—1 case 
previously examined, we have the following: 



Corollary 3.2 Let p max = p h = p h = ■ ■ ■ = p jk for 1 < j x < j 2 < ■ ■ ■ < 

jk < m, for some 1 < k < m — 1, and let pi < p ma x, otherwise. Then 
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LI n ~ Pn 



n 



k 

\fPmax{^ -Pmax) HiaX IB* (t t ) - IFfa-i) 

O=to<*i<- „ L 
<*fc-i<t*=l e=1 



where the k- dimensional Brownian motion 
covariance matrix 



/l P 



V 



P = -Pmax/ (1 - Pmax). 



P 
P 



P\ 



1 P 

p V 



(3.27) 
B k (t)) has the 



(3.28) 



Proof. Let p max = p h = p h = ■ ■ ■ = p jk , with 1 < j\ < h < ■ ■ ■ < jk < ra 
and 2 < k < m - 1, i.e., l et J* = {1,2, . . .,m}\{j 1 ,j 2 , ■ ■ ■ ,3k}- Set j = 1 
and jk+i = m. Then (13.171) becomes 



LI n Pmax n 1 ■ 
7= = V ^i^n 1 

- in m *— ' 

i=i 



m— 1 



+ max VVi-B^ti) + £ n 

0=t <ti<- 

</ i —1 ? — 1 

_: i-m — 1 — - 1 - 

m— 1 



m 
i=i 



k ie+i-i 

+ n_ m < aX <-... ^fo) + 



0=t, n <t,- 1 <• 

< t .^7. n- -i^=o i=i< 



j m— 1 
■-^^(1) + 

i=i 
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+ max 



<t jk <t 3k+1 =i 



m—l 



k—1 1 m— 1 

,£=1 i=j> i=j'fe 



m—l 



1^(1) + ^(7^(1 
i=l i=7' t 



l=3k 
k-1 



+ _ max ^ J2 ai&Jth) + E n- 



(3.29) 



We immediately recognize the first term on the right hand side of (I3.29|) 
as what we encountered for k — 1. Using the definition of the B l n , f l3.29|) can 
then be rewritten as 



fc-i if+i-i 



= + max > > (TiB (tj ) + i£, 



^n* WPmax 




fc-1 



max 



E 



n o=t jo <t h <- 
<%<% +1 =i e ~ x 



+ 



7? 



max 

7lO=t Jo <tfc<- 

<t Jk <t Jk+1 =i 



fe-i 

E 



(°K]- a [lg'- n (PJ<-P««) 



flr -I TT'Prnax) , 1 ^Pmax) 

[ nt ie\ [ nt h\ 



+ E n . 



(3.30) 



Setting a^ fc+1 = n — E^ =i a « > we n °te that the random vector (a^ 1 , a^ 2 , . . . , 
an fe+1 ) follows a multinomial distribution with parameters n and (p ma x,Pmax, 
■ ■ ■ ,Pmax, 1 — kp m ax)- It is thus natural to introduce a new Brownian motion 
approximation as follows: 
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a \ , 1 TiPmax 

B e n (t) = - B =, 1 < £ < fc. (3.31) 



max 

[i - p. 

Substituting fl3T3B into fl3T30|) gives 



fc-i 

VPmax(l-Pmax) I + UiaX V [fl^) - }• + E n 

1 0=t <*l<- ~ 1 

<t k -i<t k =l 1=1 

k 

'PmaxiX-Pmax) max V ~ )] + K- (3.32) 

0=t o <*i<- ' 



By Donsker's Theorem, our approximations (B^(t), B 2 (t), . . . , B^{t)) con- 
verges jointly to a A;- dimensional Brownian motion (B 1 (t) , B 2 (t) , . . . , B k (t)). 
This Brownian motion has the covariance structure 

/l p p ■■■ p\ 

pip ; 
t ; ••. ••. ••. ; , 

: pip 

\p P V 

where p = —p m ax/{^ — Pmax), a fact which follows immediately from the 
covariance of the multinomial distribution, where the covariance of any two 
distinct a 3 / is simply — rp 2 max , for 1 < r < n. This, together with our analysis 
of the unique p m ax case, proves the corollary. ■ 



Remark 3.3 The above results provide a Brownian functional equivalent to 
the GUE result of Its, Tracy, and Widom Iffl$ (described in detail in the 
comments preceding Theorem \3.1\ and with a law given in Remark 4-4\ )- Note 
that the limiting distribution in (13.271) depends only on k and p max ; neither 
the specific values ofji, j2, ■ ■ ■ ,jk nor the remaining values ofpi are material, 
a fact already noted in JJE/- Also, it follows from generic results on Brownian 
functionals that this limiting law has a density, which in the uniform case is 
supported on the positive real line, while supported on all of M. in the non- 
uniform case. 
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We have already seen in (13.71) that the limiting distribution for the uniform 
case has a nice representation as a functional of standard Brownian motion. 
We now also express the limiting distribution in (I3.27P as a functional of 
standard Brownian motion. Moreover, this new functional extends to the 
uniform case, although its form is different from that of (13.71) . This limiting 
random variable can be viewed as the sum of a normal one and of a maximal 
eigenvalue type one. 

Corollary 3.3 Let p max = p h = p h = ■ ■ ■ = p jk , for 1 < j 1 < j 2 < ■ ■ ■ < 

jk < m ! an d some 1 < k < m, and let Pi < p max , otherwise. Then 



LI n ~ Pmax n , f VI — kPmax ~ 1 \ ^ 

— 7^— k ^ B (1) 



+ max y2[B t (t i )-B t (ti-i)]\. (3.33) 

<t k -i<t k =i 1=1 

where [B x {t\ B 2 (t), . . . , B k (t)) is a standard k- dimensional Brownian mo- 
tion. 



Proof. Let us first examine the non-uniform case 1 < k < m — 1. Recall that 
P = — Pmax /(I — Pmax)- Now the covariance matrix in (13.281) has eigenvalues 
\i = 1 — p = 1/(1— Pmax) of multiplicity k — 1 and A 2 = 1 + {k — l)p = 
(1 — kp max )/{l — Pmax) < Ai of multiplicity 1. From the symmetries of 
the covariance matrix, it is not hard to see that we can write each Brow- 
nian motion B % (t) as a linear combination of standard Brownian motions 
(S 1 (*),..., S*(t)) as follows: 



A 



where 



B\t)=PB\t)+r l i = 1,. . . ,k, (3.34) 



= (t-i)y^ + ^ v = -vor + ^ (3 35) 



Substituting (I3.34p and (13.351) into (I3.27p . and noting that (3 — 7] = \f\{ 
- Pmax, we find that 
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K 

\4w(l -Pmax) max y^ \B e (te)-B e (t e -i 
o=*o<*i<- r-f L 
<t*-i<t*=i £=1 

fc 

= VPmmll - Pmax) max /3 [-B^) - 

o=t <*i<- -f-f L 

<tfc_l<t fc =l «=1 

fc 

= VjwO- -Aw) m ax y]{(p-ri)[B e (ti)-B t (t 
O=t <ti<- L 
<*fc-i<*fe=i e=1 

k 

+ r ] J2[B i (k)-B3(t i „ 1 )]} 

3=1 

k 

= y/PmaxO- ~ Pmax) H13X { Y](/3 - T]) \B\U) - 
O=t <h<- If-* 
<t k -l<t k =l 1=1 

k k 

+ ^^[F'(t,)-F-(t H )]} 

e=i j=i 

k 

= y/Pmaxjl ~ Pmax) U]^, B 3 (l) 



3=1 

k 



+ (J3-ri) max V [B%) - B e (t t 

n— <r... < • L 



H-l 

0=t <ti<- ^— ' 
<*jfc- 1 <*fe=l £=1 

fc 



, r Vi — kp max — i \ ^ • . ^ 

vt™^| £ 2^ B ' 1 ' 



i=i 

fc 



+ max V^t,)-^^)]). (3.36) 

<*fc-i<tfc=l 

To complete the proof, we now examine the uniform case k = m, where 
necessarily p max = 1/m. Now we saw in Proposition 13.11 that 
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n ~"' m ^\ -\--Y,^\l)+ max Y,B%)V (3.37) 
Jn \ m I m z — ' o<*i<- z — ' J 

where the (m — l)-dimensional Brownian motion (B l (t), . . . , B m ~ l {t)) had a 
tridiagonal covariance matrix given by (13. 2p . Now we can derive this Brown- 
ian motion from a standard m-dimensional Brownian motion (B l (t), . . . , B m (t)) 
via the a.s. transformations 

B\t) = - B i+1 (t)), 1 < % < m - 1. 

V 2 

It is easily verified that the Brownian motion (5 1 (t), . . . , S m_1 (t)) so ob- 
tained does indeed have the covariance structure given by (13. 2p . Substitut- 
ing these independent Brownian motions into (13.361) . we obtain the following 
a.s equalities: 

r t I rW i m-l m-l 

Jn \ m I m L — ' o<ii<--- L — ' J 

t=l <t m -i<i 1=1 

-{--EWD-B' + '(D] 

m— 1 

+ max V^^)-^ 1 ^)]) 

<t m _i<i 1=1 

-{— E^w + ^w 

m 

+ max V[S*(t < )--B < (t < _i)]- J B m (l)) 
0<ti<- ^-^ J 

<t m _!<l »=1 

/"~ 1 m m 

-{ — max D^^-^^-O]}. 

ml m z — ' o<ti<- J 

(3.38) 

which we recognize as (13.331) . with k = m and p max — l/m- ■ 
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We have already seen several representations for the limiting law in the 
uniform case. Yet one more pleasing functional for the limiting distribution 
of LI n is described in the following 

Theorem 3.2 Let p max — Pi — Vi — ' ' ' = Pm — l/ m - Then 



LIn ^l^ max V \B%) - B%A := (3.39) 



^ t m — 1 — 1 ^ " 



where (B 1 ^), B 2 (t), . . . ,B m (t)) is an m- dimensional Brownian motion hav- 
ing covariance matrix (13.281) . with p — — l/(m — 1). (This Brownian motion 
satisfies J2T=i B\t) = 0, for allO<t<l.) 

Proof. We show that the functional being maximized in (I3.39P has the same 
covariance structure as the functional being maximized in (13.71) . a result 
which we restate as: 



Ll n — n/m 



Jn^^Jm" oS a < X - ^ lA^WO - ViB%)] , (3.40) 



rn - 



where j3{ = \jijii + 1) and r\i = \J {i + 1)/^. From this it will immediately 
follow that the maxima, over all < t\ < t% < • • • < i m -i < 1, in both 
expressions have the same law, clinching the proof. 

Given that the zero-sum condition on the Brownian motion is in force in 
(I3.39p . it is natural to rewrite (I3.39P as 



LI n — n/m 1 



-= max V l&iU) - B%-i)] , (3.41) 



<+ 1 <i —1 t=X 
_: i^m — l _ L m — - 1 

where (B 1 ^), B 2 (t), . . . ,B m (t)) is an m-dimensional Brownian motion with 
a permutation-invariant covariance matrix described by 



Tfi — 1 \ — / 

GoN(B\t),B\t)) ' ' 



m J m — 1 

t + ■ 

— , IT 3 
m 
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while 



„ 777 — 1 

Var£ l (t) = 1. 



m 



Let t = (ti, t 2 , • • • , tm-i) be a fixed collection of £j from the Weyl chamber 
T = {fa, t 2 , . . . , t m _!) : < ti < t 2 < • • • < t m - x < 1}. Setting 



we then have 



X t = ^2\B i {t i )-B i (t i . 1 ) 



(3.42) 



i=i 



Cov(X t ,X s 



to — 1 



m 



l<i ,j<m 



m — 1 



^ A - tj A Si_i - A + A Si_i] 



i=l 



— - ^2 A s i ~ ^ A Sj_i - A sj + A Sj-i] . (3.43) 



We can rewrite (13.431) in an especially clear way by setting T\ = [0, t\] and 
Ti = (U,t i+ i\, i = 2, ...,m, and similarly Si = [0, s x ] and Si = (sj,s i+ i], 
i — 2, . . . ,m. Letting Leb denote the Lebesgue measure on [0, 1], a case-by- 
case analysis of the relative positions of tj,tj_i,Sj, and Sj_i quickly yields 
that 



— 1 m 1 
Gov(X u X a ) = — V Leb{Ti n 3A V LebiT, n S 



TO — 1 



rn 



i=l 



Y.LebiTiHS, 



i=i 



m 
1 

m 



#3 



1 - Le6(Ti n ^ 



i=i 



1 m 

— + J^LebiTiHSi). 

i=l 



(3.44) 



From ( 13.441) we clearly have VarX t = (to — l)/m, for all t G T. To complete 
the proof, we now show that 
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m— 1 



Y t = Y,[PiB\U +1 )-r ll B\U 



(3.45) 



has the same covariance structure as X t} where $ = \fij\i + 1) and = 
+ Using the independence of the components of the Brownian 

motion, we also have 



m—l 



Cov(Y tl Y s ) = £)Cov -^(^^^(si+O -^(sO) 



i=l 
m—l 

E 

i=l 

m . ^ m—l 



z . z + 1 

— — A s i+ i) - t i+1 A Si — ti A Sj+i H — (tj A 

z + 1 % 



i + l 

t i+1 A Si - U A s i+ i — tj A Si 



i=l 



i=l 



m — l 



m—l 



2J [*t+i A Si - ^ A Si+i - 2(ti A Si)] . 



(3.46) 



i=i 



As before, a simple case-by-case analysis of the summands in (13.461) re- 
veals that 



Cov(Y t ,Y s ] 



m—l 



m 



1 - Leb ( T i n Si 



i=l 



1 m 

— + ^ Leb(Tif]Si 

m z — ^ 

i=i 



(3.47) 



completing the proof. 



4 Large-m Asymptotics and Related Results 

With the covariance structure of X t now in hand, we can compute the L 2 - 
distance between any X t and X s : 
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E{X t - X s ) 2 = VarX 4 + Var X s - 2 Cov(X t , X s 



2(1 - 1/m) - 2 



■l/m+^LebiTinSi 



i=i 



l-^Left^n^) 



i=i 



(4.1) 



Such a metric is useful, for instance, in applying Dudley's Entropy Bound to 
show that 



E | max 

0=t <ti<- 



X t < KVm - 1, 



.<tn 



l<t„ 



for some constant K not depending on m. 

We can now more clearly see the similarities between the functional D m 
of Glynn and Whitt in (13. 8p and that of (13.71) . which we have shown to 
have the same law as H m in (13.391) . Indeed, the only difference between the 
functionals is simply that in (13.81) the Brownian motions are independent, 
while in (13.391) they are subject to the zero-sum constraint. Gravner, Tracy, 
and Widom [9] have already remarked that random words could be studied 
via such Brownian functionals. In fact, a restatement of Corollary 13.31 shows 
that, in law, D m and H m differ by a centered normal random variable, as 
indicated by the next theorem and corollary. This, in turn, will allow us to 
clearly state asymptotic results for H m from the known corresponding results 
for D m . 

Theorem 4.1 Let 

m— 1 ra— 1 

+ max V B\U 



i=l 



max 

0<ti<-- 
tm— 1 * 



m > 2, and let Hi = a.s., where (i? 1 (t), . . . , B m 1 (t)) is an (m — 1)- 
dimensional Brownian motion with tridiagonal covariance matrix given by 
fl3T2D . Let 

m 

An = max y \B\U) - Bifa-i 

<t 1 <t —1 l= i 
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where (i? 1 (t), . . . , B m (t)) is a standard m- dimensional Brownian motion. 
Then D m = Z m + H m a.s., where Z m is a centered normal random vari- 
able with variance 1/m, and in fact is given by Z m = (1/m) 

Proof. The m — 1 case is trivial. For m > 2, reformulating the proof of 
Corollary 13. 3[ for the uniform case, in terms of the functionals H m and D m 
shows that 



1 



s/m \ m J 



m ' 



almost surely, and hence D m = Z m + H m a.s. 

Recalling the definition of H m from Theorem 13.21 

m 

H m := max V) \b%) - B%. x ) 
o=t <t 1 <- L 

_^<^m — 1 2z*"m — - 1 - 

where B 2 (t), . . . , B m (t)) is an m-dimensional Brownian motion having 

covariance matrix (I3.28p . with p = — l/(m — 1), i.e., ^2^LiB t (t) = 0, for all 
< t < 1, we then have 

Corollary 4.1 For each m > 1, H m — D m + Z m , where d denotes equality 
in distribution. 

Proof. Proposition 13.11 asserts that 

LI n — n/m H,, 



in 



'n \/m 
as n — > oo, while by Theorem 13.21 

LI n — n/m H„ 



'n yjm 

as n — > oo as well. The conclusion follows from the previous theorem. 



32 



This relationship between H m (resp.,H m ) and D m allows us to further 
express the limiting distribution in a rather compact form. 

Proposition 4.1 Let p max = p h = p h = ■ • • = p jk , for 1 < j x < j 2 < ■ ■ ■ < 

jk < m, and some 1 < k < m, and let pi < p ma x, otherwise. Then 



LI n - ' Pmax^ J r f\ j r? j j T 

- => VP 

max\ V n >t , max 



n 



= y/PmaxiV^ ~ kp max Z k + H k }. 

Proof. For k = m, we have p m ax = l/ m J ari d thus simply recover the 
limiting distribution H m / ^pm = H m / ^Jm of the uniform case. 

For 1 < k < m — 1, we saw in Corollary 13.31 that we could write the 
limiting law of (LI n - p max n)/ ^/n as 



/ Vl — kprnax ~ 1 \^ u j/- [ \ 

VPmax^y 7 H 1 ) 



A- 



+ max Vlfl^-B^O]}, (4.2) 

0=to<*i<- J 
<*fc-l<*fc=l £=1 

where (B l (t), B 2 (t), . . . , B k (t)) is a standard fc-dimensional Brownian mo- 
tion. But, recalling the definitions of D k and Z k , and the fact that D k = 
Z k + H k a.s., (14. 2 p becomes 



y/Pmax \ — ~T~~" + D k 



ij 



VPmax | (V 1 - ^mai ~ Z k + (Z k + H k ) j 
y/Pmax | \/l — kp max Z k + -£/fc | 

VPmax | a/ 1 - kp max Z k + H k | . (4.3) 
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Remark 4.1 One might also write the limiting law of Proposition 4-1 in 
terms of the functional D/, . Indeed, we have 



LI n - 

=> (a/ 1 - ^max - l)Z k + Dfc, 

so that the limiting law is expressed as the sum of a centered normal random 
variable and of the maximal eigenvalue of a k x k element of the GUE. 

The behavior of D m has been well-studied. In particular, it is known 
that D m /^/m -> 2 a.s. and in L 1 , as m -> oo (see |U El [121 1221 1231 126]), 
and that (D m — 2y / m)m 1 / 6 =>- i<2, as m — > 00, where F 2 is the Tracy- Widom 
distribution (see [U El [2Z1 I2"5]). From these results, the asymptotics of H m 
follows. 

Theorem 4.2 VFe /iave £/ia£ 

Hry, 



*m 

a.s. and in L l , as m — > 00. Moreover, 

H, 



- 2 m 2/3 F 2 , (4.4) 



where F 2 is the Tracy- Widom distribution. The same statements hold for H n 
in place of H m . 

Proof. From Theorem 14.11 we have D m = Z m + H m a.s., where Z m = 
Clearly, Z m — > a.s. and in L 1 . Thus, a.s. and in L 1 , 

1 . -^m 1 . Dm. 

hm — == = hm 



m-*oc ^/m m-»oo ^777, 

Since this last limit is 2, and since, for each m > 1, H m = H m , it also 
follows that 

lim E " 



m 
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We are thus left with proving the a.s. covergence to 2 of H m / ypm. Since 
the variance of the functional being maximized in the definition of H m equals 
1 — 1/m, the Gaussian concentration inequality then implies that 

-h 2 _ h 2 

¥(\H m - E# m | >h)< 2e 2 < 1 -^» < 2e~ 

for all h > 0. Then since W*H m /y/m — » 2 as m — > oo we have for m large 
enough that 



P 



H n 



-2 



> /i < P 



< P 



> \frn \ h 

\/mK 

> — 



< 2e 



This concentration result implies that 



m=l 



m 



< oo, 



m=l 



and the Borel-Cantelli lemma allows us to conclude the proof of a.s. conver- 
gence. 

Turning to the limiting law, we know ([HE]) that D m has the same dis- 
tribution as the largest eigenvalue of the mxm GUE. Then the fundamental 
random matrix theory result of Tracy and Widom [28J implies that 



D, 



ni 



2 ) m 2/3 F 2 . 



Since, moreover, D m = Z m + H m , and since Z m has variance 1/m, 
Zmin 1 / 6 =>- 0, and so 



Hm - 2\ m 2 / 3 =(£^-2\ m 2 / 3 - Z m m}/ % F,. 



ml \ \ m 



Finally, H m — H m , and the same result holds for H m in place of H n 
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Remark 4.2 (i) In the conclusion to [27^ , Tracy and Widom already de- 
rived f)4.4p by applying a scaling argument to the limiting distribution of the 
uniform alphabet case. In our case we can moreover assert that a.s. and in 
the mean, 

hm hm - = 2, 

fc->+oon-H-oc y/kp max n 



and that 

LIn Pmax^ 



2 3 



where the weak limit is first taken over n and then over k. 
(ii) Using scaling, subadditivity, and concentration arguments found in Ham- 
bly, Martin, and O'Connell fTB{ and in O 'Cornell and Yor [22], one could 
prove directly that H m / \fm — > 2 a.s. This could be accomplished by studying, 
as do these authors, a process version of H m , i.e., 



m 

H m {e):= max V - frfa-i) 

0=t <ti<-~ '—J I 



for e > 0. With obvious notations, for all e > and m > 1, D m (e) = 
Z{e) + H rn {e), a.s., where Z{e) = (1/m) YZ i B\e) . 

To see in further detail how D m and H m are related, first note that 
D m < D m+ i a.s. for m > 1, since D m can simply be obtained by restricting 
the right-most parameter t m to be 1 in the definition of D m+ i. We now show 
a stochastic domination result between D m and H m . 

Recall that a random variable X is said to stochastically dominate another 
random variable Y (i.e.,X > st Y) if for all x e M. we have P(X > x) > 
P(Y > x). 

Proposition 4.2 H m > st a/(1 — l/m)D m , form > 1. The same statement 
holds for H m in place of H m . 

Proof. Since the m = 1 case is trivial, let m > 2. We saw in (13.441) that 
the functional X t being maximized in the definition of H m had a covariance 
structure given by Cov(X t , X s ) = — 1/m + Y^lLiLeb(Ti fl Si). A similar 
argument shows that the functional U t = YlT=i [-^*(^) — which is 
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being maximized in the definition of D m has a covariance structure given by 
Cov(U t , U s ) = YZi Leb (Ti n S { ). Therefore, 

Var(V(l - l/m)U t ) = VarX t = 1 - 1/m, 

and 



Cov(V(l - l/m)U t , y/(l - l/m)U 8 ) = (1 - 1/m) ^Le6(T, n S t ) 

i=i 

> Cov(X t ,X s ). 

By Slepian's Lemma we conclude that H m > s t a/(1 — TJm)D m . The final 
assertion follows from the equality in law between H m and i7 m . ■ 

Remark 4.3 Note that 

E(X t - X s f = E(y/(l-l/m)U t - - l/m)U s ) 2 
= 2^-^Leb{T i nS i )\ 

for all s,t E [0, 1]. That is, while X t and — l/m)U t have different co- 
variance structures, their L 2 -structures are identical. The Sudakov-Fernique 
Inequality then allows us to conclude again that E_ff m = 7J)„. in a manner 
independent of the development of Theorem \4-l\ 

Remark 4.4 Let us briefly summarize the connections between random ma- 
trix theory and the Brownian functionals encountered in this paper. Writing 
x (m) _ (xi,X2, . . . ,x m ), and defining A(x) = U.i<i<j< m (xi — Xj) to be the 
Vandermonde determinant, we have the following facts. 

(i) D m = \[ m \ where is the largest eigenvalue of the m x m GUE, with 
the scaling taken to be such that the diagonal elements X^i satisfy 'EJCf i = 1, 
and the off-diagonal elements Xjj, for i ^ j, satisfy K\Xij\ 2 = 1. Using 
standard random matrix results (see, e.g., JMDj), the distribution of D m , for 
all m > 1 and all s G M, is given by 
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where 



where 



¥{D m <s)=c m I e- E -^ 2/2 A(x)W' 

J A s 



A s = {x G M m : max ar f < s}, 

KKm 



m) 



(ii^ il m = A^' "*, where A l | m ' ' ) zs £/ie largest eigenvalue of the mxm traceless 
GUE, with the scaling as in (i). Using the joint density of the eigenvalues of 
the traceless m x m GUE I21\ p27| / ; the distribution function of H m can also 
be computed directly, for all m > 2 and all s > 0, as 

¥(H m < s ) = c° m [ e~^^ x 'A(x) 2 dx^°\ 
Jao 

where dx^ m ' ^ is Lebesgue measure over the set {Y^Li x i — 0}, and where 

( m 

A° = {xeR m : max a* < si D < V" x t = 



i=l 



where 



(4)-!= / e-^ 2 ^^A(x) 2 dx^°\ 
Note that H m is a.s. non-negative, and so F(H m < s) = 0, for all s < 0. 



(Hi) J m := ^Pmax{V^- ~ kPmaxZk + H k }, the limiting functional of Proposi- 



tion 4-1 for the m-letter non-uniform case, having its most probable letters of 
multiplicity k occuring with probability p ma x, is equal in law to the sum of a 
normal random variable and a variable whose distribution, up to the scaling 
factor ^Jp m ax, is that of the largest eigenvalue of the k x k traceless GUE, 

with the scaling as in (i) and (ii). (Note also that since D m = Z m + H m , 
J m is also equal in law to the sum of a normal random variable and a vari- 
able whose distribution, up to the scaling factor Jp max , is that of the largest 
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eigenvalue of the k x k GUE.) Its, Tracy, and Widom [Tby show that, for all 
m > 2 and all s 6 1, J m has distribution given by 



where 



< s) = c k , Pmax [ e-^IS^^+i^GCtx^lA^W*), 
A s = {x G R fc : max sc* < s}, 



i<j<fc 



i _ / „-^[Etx-f+T^^(ELi^) 2 ] A(x) 2 dx «_ 



and where 



c kv = e 2 p 

Moreover, in the discussion prior to Theorem \3.1\ we noted that the k-fold 
integral representation of the limiting distribution of J m came from simplify- 
ing a more complex expression. This expression described the distribution of 
J m as that of largest eigenvalue of the direct sum of d mutually independent 
GUEs, each of size kj x kj, 1 < j < d, subject to the eigenvalue constraint 
Y^hLi \/Pi\ = 0- Here the kj were the multiplicities of the probabilities having 
common values, the pi were ordered in decreasing order, and the eigenvalues 
were ordered in terms of the GUEs corresponding to the appropriate values 
of Pi- 

Note that when k = 1, the limiting distribution becomes simply 
HJ m < s) = 1 f e-^jwCi-zw)^ 

■\J 1irp max (\ Pmax) J -co 

which is simply a iV(0,p ma:c (l — p max )) distribution. 

(iv) The Tracy-Widom distribution function F 2 , which also describes the lim- 
iting distribution of (La n — 2y / n)/n 1 / 6 , (see JB$), is given, for all t el, by 



F2(t)=6XP (~/ 



[x — t)u 2 (x)dx 



where u(x) is the solution to the Painleve II equation u xx = 2u 3 + xu with 
u(x) ~ —Ai(x), as x — > oo, where Ai is the Airy function. 
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5 Countable Infinite Alphabets 



Let us now study the problem of describing LI n for an ordered, countably 
infinite alphabet A = {a n } n >i, where ai < «2 < • • • < ot m < Let 
(Xi)f =l , Xi G A, be an iid sequence, with P(X% = a r ) = p r > 0, for r > 1. 

The central idea in the first part of our approach is to introduce two new 
sequences derived from (Xj)" =1 . Fix m > 1. The first sequence, which we 
shall term the capped sequence, is defined by taking T™ = Xi A a m , for i > 1. 
The second one, 

(y i m )i=r, the reduced sequence, consists of the subsequence 
of (Xj)™ =1 of length iV n>m , for which Xi < a m , for i > 1. Thus, the capped 
sequence (T™)™ =1 is obtained by setting to a m all letter values greater than 
a m , while the reduced sequence (Y[ a )^"( n is obtained by eliminating letter 
values greater than a m altogether. 

Let LI^ and to be the lengths of the longest increasing subse- 

quence of (7T)r=i and ( y r);=T> respectively. Now on the one hand, any 
subsequence of the reduced sequence is again a subsequence of the original 
sequence (Xj)™ =1 . On the other hand, any increasing subsequence of (Xj)" =1 
is again an increasing subsequence of the capped one. These two observations 
lead to the pathwise bounds 

Lint < LI n < LIZ*, (5-1) 

for all m > 1 and n > 1. 

These bounds suggest that the behavior of the iid infinite case perhaps 
mirrors that of the iid finite-alphabet case. Indeed, we do have the following 
result, which amounts to an extension of Theorem 13. II (or, more precisely, of 
Proposition 14.11) to the iid infinite-alphabet case. 

Theorem 5.1 Let (Xi)i>i be a sequence of iid random variables taking val- 
ues in the ordered alphabet A = {a n }n>i- Let P(Xi = <x,) = pj, for j > 1. 
Let p max = p h = Pj2 = ■■■ = Pjk> 1 < j 1 < j 2 < ■ ■ ■ < j k; k > 1, and let 
Pi < Pmax, otherwise. Then 



/= =^ V Pmax { \/i PmaxkZfr + -fffc} '.— R(p m ax> k) . 



The proof of the theorem relies on an understanding of the limiting distri- 
butions of LI™ m and LI™^. To this end, let us introduce some more notation. 
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For a finite m-alphabet, and for Vy, . . . , V n iid with P(Vi = a r ) = q r > 0, let 
LI n (q) := LI n (qi, . . . , q m ) denote the length of the longest increasing subse- 
quence of (Vi)f =1 . For each m > 1, let also 7i m = J2T=iPr- 

First, let us choose m large enough so that 1 — 7r m _i < p max - Next, 
observe that, from the capping at a m , LI^ is distributed as LI n (p), where 
p = (pi, . . . ,p m -i, 1— 7T m _i). But since m is chosen large enough, the maximal 
probability among the entries of p is then p max , of multiplicity k, as for the 
original infinite alphabet. By Theorem 13.11 we thus have 

LIn{P) ~ Pmaxn ^R( Pmax ,k), (5.2) 

asn^ oo. 

Turning to LI^ e m , suppose that the number of elements N nym of the re- 
duced subsequence (Y™)^" 1 is equal to j. Since only the elements of (Xj)™ =1 
which are at most a m are left, LI™ m must be distributed as LIj(p), where 
V = (pi/ftm, ■ ■ ■ jPrn/ftm)- From the way m is chosen, the maximal probability 
among the entries of p is then p ma x/^m, of multiplicity k. Invoking again the 
finite-alphabet result of Theorem 13.11 we find that 

LI nip) (.Pmax I "rnjH r, ( Pmax , \ /. 

R ,k , (5.3) 



jn \ 7T„ 

as n — > oo. 

We now relate the two limiting expressions in (15.21) and (15. 3ft by the 
following elementary lemma. 

Lemma 5.1 Let k > 1 be an integer, and let (q m )m=i ^ e a sequence of reals 
in (0, l/k] converging to q > 0. Then R(q m , k) =>- R(q, k), as m — > oo. 

Proof. Assume q > 0. Then 



R{q m , k) = V^m - QmkZ k + 

= y/q^W 1 - 1 kz k + H k } 



+ y/qm~{\/ 1 - Qmk - a/1 - qk}Z k 
q ^R(q,k) + c m Z k , (5.4) 
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where c m = a/1 — q m k — y/1 — qk. Since q m — > q as m — > oo, c m — > 0, and so 
c m Z fc =>- 0, as m — > oo. This gives the result. The degenerate case, q = 0, is 
clear. ■ 

The main idea developed in the proof of Theorem 15.11 is now to use the 
basic inequality (15. ip in conjunction with a conditioning argument for LI™^, 
in order to apply Lemma I5TTI i.e., to use RiPmax/^m, k) => R(p max , k), as 
m — > oo, since 7r m — > 1. 

Proof. (Theorem 15. lj) First, fix an arbitrary s > 0. As previously noted 
in Remark 13.31 R(p m ax, k) has a density supported on R (R + in the uniform 
case), and so s is a continuity point of its distribution function. Next, choose 
< ei < 1, and < 5 < 1, and again note that (1 + 6)s is also necessarily a 
continuity point for R{p max , k). 

With this choice of e x , pick (3 > such that P(Z > 0) < ex/2, where Z 
is a standard normal random variable. Finally, pick £2 such that < < 
€iF(R(p max , k) < (1 + 5)s). Such a choice of e 2 can always be made since the 
support of R(p m ax, k) includes M. + . 

We have seen that, for m large enough, we can bring some finite- alphabet 
results to bear on the infinite case. In fact, we need a few more technical 
requirements on m to complete our proof. Setting cr^ = 7T m (l — 7T m ), we 
choose large enough m so that: 

(1) 1 TT'm— 1 *^ Pmaxi 

(ii) + p maa; /5o- m /7r m )/v / 7rm - /?cr m < (1 + 5)s, and 

(iii) \¥(R( Pmax , k) < (1 + <J)s) - ^{Ripmox/^m, k) < (1 + <J)s)| < e 2 /2. 

The conditions (i) and (ii) are clearly satisfied, since 7r m — > 1 and cr m — > 0, 
as m — > oo. The condition (iii) is also satisfied, as seen by applying Lemma 
15.11 to RiPmax/^m, k), with 7r m — > 1, and since (1 + <5)s is also a continuity 
point for R(p ma x, k). 

Now recall that LI^ is distributed as LI n (p), where p = (px, ■ ■ ■ ,Pm-i, 1— 
7r m _i). Hence, we have from (15. ip and ( 15. 2ft that 



LI n ~ PmaxH , LI n ? n Pmax n „, lN , , 

< 1= => R{Pmax, k), (5.5) 



n \/n 
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and so 




— v^-V 

F{R{ Pmax ,k) < s), (5.6) 



as n — ► oo (and, in fact, for any s6l). 

More work is required to make use of the left-most minorization in (15.11) 
, Lr n e ^ < LI n .) Recall that if the length N njTn of the reduced se- 



i.e. 



quence is equal to j, then LI^^ must be distributed as LIj(p), where p 
(pi/^m, ■ ■ ■ , Pml Km)- Now the essential observation is that N nym is distributed 
as a binomial random variable with parameters 7r m and n. It is thus natural 
to focus on the values of j close to ¥,N n<m = wr m . Writing the variance of 
N n<m as ncr^, where, as above, <r^ = vr m (l — 7r m ), and 

1n,m,j := nNn,m = j) = ( " ) <(l - 7T m )"^, 



J 



we have 



E Tl . / LIn m PmaxK , . \ 

P 7= < s iV nim = j 7 n m 



j=0 



j=0 
n 



n,m,j 



^ 1 71 \J\ ~Jn 



3=0 



n ~ — ) ) ) ^n, m ,j 



3=\rm m — pcrms/n\ 

\n-K m -(3cj m ^/n\ — l 

J'=0 
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LI j{p) - 



} = \rvK m -0a my /n\ 

+ ei, (5.7) 



for sufficiently large n, where (15. 7p follows from the Central Limit Theorem 
and our choice of j3, and where, as usual, [■] is the ceiling function. 
Next, note that for \riTr m — (3a m y/n\ < j < n, and by condition (ii), 



I n I P maX I Yl ^ 

j \ \fn V 

, , n f p max ( nir m - (3a m Vn 



S + 



a/ 7r m - (3a m / y/n V % 

1 ( , PmaxPcr, 

< = = s + 



<s(l + S). (5.8) 
Hence, for sufficiently large n, we have 



vj V J V V % 



3=\rm m —l3a m y/n\ 
+ Cl 

I' an; r 



n - — 7 



ln,m,j 



< £ P ( J W *» < 8 (1 + j 7B|tBj + Cl . (5.9) 

j= \n-Km -Perm Vn} 

Now from the condition (iii), and from the weak convergence, as j — > oo, 
of (X/,-(p) - iPmax/^m)j)/V3 to R(p max / n m , k) , we find that, for j large 
enough, 
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p 



< 



'LI 3 [ 



Pmax 



P 



< (1 + 5)s ) - F(R(p max , k) < (1 + S)s) 

■3 - /- r-4 \ tth I t-. / Pma.r 



<(l + 5> -P it! =,jH < (1 + 5)5 



7I>, 



+ 



P(#C?W, fc) < (1 + 5)s) - P i? fe < (1 + 5> 



71",, 



^2 ^9 

< — + — 

2 2 

< ei P(i?(?w,fc) < (l + 5) S ), 
and so, 



(5.10) 



'Li+p) - 

I ^. < (1 + 5)5) < (1 I r t )P (/?(/,,„„,.*■) < (1 4 5).s). (5.11) 



Now since [n7r m — /3cx m A/n] — > oo, as n — > oo, with the help of (15.91) and 
(15~TT|) . (ET7|) becomes 



n 



< (1 + e 1 )P(i2(p mM ,A;) < (1 + 5)s) 

j= [n7r m -/3<j m ^/n\ 

< (1 + ei)P (i?(jw, fc) < (1 + 5)s) + ei. (5.12) 
From (15. ip we know that £i£ 6 ^ < £i n a.s., and so 



P 



LI n Pmax n , \ , / ^n,m Pmax n 
< S < P 



< s 



< (1 + Cl )P (i?(jw, fc) < (1 + 5)s) + £lj (5.13) 



for large enough n. But since ej and 5 are arbitrary, (15 . 1 3[) and (15. 6p together 
show that 
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n Pmax 



n 



< s 




F(R(p max ,k) < s) 



(5.14) 



for all s > 0. 

The proof for s < is similar. Indeed, since necessarily p max < 
R(p max ,k) describes the limiting distribution of the longest increasing sub- 
sequence for a non-uniform alphabet, and so is supported on K. One then 
needs only examine quantities of the form, e.g., F(R(p max , k) < (1 — 5)s), 
instead of F(R(p max , k) < (1 + 5)s), as we have done throughout the proof 
for s > 0. These changes lead to the resulting statement. ■ 

Remark 5.1 As an alternative to the above proof, one could certainly adopt 
the finite-alphabet development of the previous sections so as to express LI n , 
for countable infinite alphabets, in terms of approximations to functionals of 
Brownian motion. More precisely, 



where a™ counts the number of occurrences of the letter a m among (Xi)i<i< n , 
and S r k = ^2 i=1 Z\ is the sum of independent random variables defined as in 
(12.31) . After centering and normalizing the S^, as was done to obtain (13.111) 
in the non-uniform finite alphabet development, one could then try to apply 
Donsker's Theorem to obtain a Brownian functional, which we now know to 
be distributed as R(p m ax, k). 

6 Concluding Remarks 

Our development of the general finite-alphabet case leads us to consider sev- 
eral new directions in which to pursue this method and raises a number of 
interesting questions. These include the following. 
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• Extending our fixed finite-alphabet case to that of having each X n take 
values in {1, 2, . . . , m n } is an important first step. Fruitful approaches to 
such asymptotic questions would nicely close the circle of ideas initiated 
here. Such a study is already under consideration (see [14]). 

• As we have noted throughout the paper, there is a pleasing if still rather 
mysterious connection between our limiting distribution results and those 
of random matrix theory. This connection deserves to be further explored. 
Recall, for instance, Baryshnikov's observation [4J that the process D m is 
identical in law to the process A^ consisting of the largest eigenvalues of 
the m x m minor of an infinite GUE matrix. This fact is consistent with 
an interleaving-eigenvalue result from basic linear algebra, namely, that if 
Ai > A 2 > • ■ • > A n are the eigenvalues of an n x n symmetric matrix A, and 
if fMi > 1^2 > • • ■ > /i n -i ar e the eigenvalues of the matrix consisting of the 
first {n — 1) rows and columns of A, then Ai > \i\ > A 2 > ■ • ■ > /i n -i > A n . 
We thus see the consistency between the D m < D m+ i a.s. fact noted above 
and that of Ai > fix. 

• Pursuing our analysis further, one might hope to find ways in which 
we can derive the densities of our limiting functionals in a direct manner. 
Its, Tracy, and Widom [TB] have obtained clear expressions of the limiting 
distributions. While we have obtained our limiting distributions in a rather 
direct way, in turn, these densities do not clearly follow from our approach. 
This point deserves more work. 

• In another direction, our independent-letter paradigm can be extended 
to various types of dependent cases, foremost of which would be the Markov 
case. This will be presented elsewhere [15], where the framework of [13] is, 
moreover, further extended. 

• Various other types of subsequence problems can be tackled by the method- 
ologies used in the present paper. To name but a few, comparisons for 
unimodal sequences, alternating sequences, and sequences with blocks will 
deserve further similar studies. 
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